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• Premise of the study: Fourteen genomic microsatellite markers were developed and characterized in honey locust, Gleditsia 
triacanthos, using Illumina sequencing. Due to their high variability, these markers can be applied in analyses of genetic diver- 
sity and structure, and in mating system and gene flow studies. 

• Methods and Results: Thirty-six individuals from across the species range were included in a genetic diversity analysis and 
yielded three to 20 alleles per locus. Observed heterozygosity and expected heterozygosity ranged from 0.214 to 0.944 and 
from 0.400 to 0.934, respectively, with minimal occurrence of null alleles. Regular segregation of maternal alleles was ob- 
served at seven loci and moderate segregation distortion at four of 1 1 loci that were heterozygous in the seed parent. 

• Conclusions: Honey locust is an important agroforestry tree capable of very fast growth and tolerance of poor site conditions. 
This is the first report of genomic microsatellites for this species. 
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Honey locust {Gleditsia triacanthos L.), a common leguminous 
tree native to the eastern and central United States, occurs on rich 
bottomlands and rocky upland slopes and is a frequent invader of 
abandoned fields (Schnabel et al., 1998). It is used in land reclama- 
tion efforts due to its fast growth and tolerance of poor site condi- 
tions (Preston and Braham, 2002). Honey locust populations are 
characterized by wide genetic variation in adaptive traits such as 
winter hardiness in northern races and more nutritious fruits in the 
south. Application of molecular genetic techniques such as marker- 
assisted selection can be used in enhancing germplasm selection 
efficiently compared to traditional breeding procedures. Develop- 
ment of genetic resources in this important species will aid studies 
of genetic diversity among its populations, and will help identify 
genes underlying desirable traits of interest in developing sustain- 
able management strategies for this important but underutilized 
species. Identification of genetic diversity within and among re- 
gions in honey locust through assessment of genetic variation in 

Manuscript received 6 June 2013; revision accepted 23 July 2013. 

The authors thank Nick Wheeler for his guidance and contributions, and 
Richard Cronn, Brian Knaus, and Kimberly Hansen for assistance with 
Illumina library construction and sequencing. This research was supported 
by the National Science Foundation Plant Genome Research Program 
(grant no. TRPGRA2 IOS-1025974). Additional funding was provided by 
the Ecosystem Science Center and the Biotech Research Center of Michigan 
Technological University. 

9 Author for correspondence: ogailing@mtu.edu 

doi:10.3732/apps.l300050 



different environments is also important for efficient selection of 
genotypes for land restoration purposes. 

To date, no microsatellite resources have been developed in 
honey locust for characterizing its genetic resources. Gene-based 
microsatellite markers (expressed sequence tag-simple sequence 
repeats [EST-SSRs]) developed for related species such as Medi- 
cago truncatula Gaertn., Ceratonia siliqua L., and Copaifera offi- 
cinalis (Jacq.) L. show low transferability and are not polymorphic 
in honey locust (data not shown). Next-generation sequencing is 
now frequently used for easy and rapid development of microsatel- 
lite markers across many different taxa (Jennings et al., 2011), re- 
ducing time and costs for sample processing and sequencing. Using 
low-coverage, paired-end Illumina genome sequencing of G. tria- 
canthos, we characterized 14 nuclear microsatellite markers and 
assessed their variability in 36 samples from a provenance trial. 

METHODS AND RESULTS 

Low-coverage whole genome sequencing (Jennings et al., 2011) was used to 
produce an initial set of genomic resources for 10 hardwood tree species, including 
honey locust (Staton et al., in prep.). Illumina libraries were created from sonicated 
genomic DNA extracted from leaflets of one individual (seed parent of 88 single-tree 
progeny, Appendix 1) of G. triacanthos per the manufacturer's protocol (QIAGEN 
DNeasy96 Plant Kit; QIAGEN, Hilden, Germany). Libraries were constructed us- 
ing Illumina TruSeq version 2 index sequencing adapters (Illumina, San Diego, 
California, USA), and then pooled in an equimolar mixture and sequenced using 
101-bp paired-end chemistry on an Illumina HiSeq 2000 at the Oregon State Uni- 
versity Center for Gene Research and Biocomputing. After sequencing, reads for 
individual libraries were sorted by index. Because the input DNA was sheared to a 
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Table 1. Characteristics of 14 novel genomic micro satellite markers developed in Gleditsia triacanthos: 



Locus 




Primer sequences (5'-3') b 


Amplicon size (bp) 


Repeat motif 


T a (°C) 


Size range (bp) 


CjLI 002 


F : 
R: 


NE D - T AAAAAGT AAC C T T AAAGG 
AGTAAAGAGGTAACGATTT 


104 


(AT) 9 


56 


103-147 


CjLI 02 1 


F : 
R: 


6-FAM-ATATCACCAATTTAAGACC 
GTACACAAAACTTCGAGAG 


100 


(AG) n 


56 


94-98 


CjLI 026 


F : 
R: 


VIC- AAGC T T GAT T AG AG AAAT T 
AGATAGTTCCTTTCAGTTG 


127 


(AT) 14 


56 


113-143 


CjI 1057 


F : 
R: 


PET-CAGGTAAAACATGAGATTGATGC 
TTCCATAAAATCAGTCATGCAA 


121 


(TA) 9 


56 


127-157 


CjI 1063 


F : 
R: 


NED-CTCTTGCGCACACTAAAACG 
CGTACGGTGACACTTGTGC 


116 


(AC) 12 


56 


147-187 


OTT073 


F : 
R: 


VIC-CATGATTTAGAGAGAGAAATGTTTTGG 
AACCAAGCCCTTCATTTATGG 


109 


(GA) 13 


56 


134-164 


OTT114 


F : 
R: 


6-FAM-TCAAGCTAGTTAGCCTTCCTGC 
AAATATGGGAGCAATGAACC 


121 


(TC) 20 


56 


1 m 1 o o 

102-138 


OTT116 


F : 
R: 


NED-CTAAAGCTTGACTTCTGAATCC 
CGCTATATCGGAATCCCTGC 


134 


(CT) 8 


56 


131-143 


GTT117 


F : 
R: 


PET-GGTGGTATGTGCAAGCAAGC 
CTTGAGCCACCCATTACCC 


120 


(TA) 8 


56 


110-122 


GTT118 


F: 
R: 


6-FAM-CAGTCCCACCTTCACTAGCC 
TGCGTGTAATCTGAGCTTGG 


119 


(CT) 8 


56 


110-130 


GTT126 


F: 
R: 


PET-TGGATTAAGTTGTAAAGCGAGG 
CCGTCAAACTTAAGACCCACC 


109 


(AT) 8 


56 


98-146 


GTT131 


F: 
R: 


6-FAM-CTTTGAACTCTAATACTCTGGTTGC 
TCAACCACCTTAAGACATCCC 


100 


(AC) 9 


56 


91-153 


GTT132 


F: 
R: 


VIC-CAGTCCTCATGTCTAGTCTAGTGC 
CAATCTCTGGTGCAAGATGC 


105 


(AT) n 


56 


90-130 


GLT4027 


F: 
R: 


6-FAM-AGGAATTATTCTCTACCAA 
CGAATCTCATTTTATACAA 


107 


(TCCA) 6 


56 


91-107 



Note: T a = annealing temperature. 

a Geographic coordinates for the provenances are given in Appendix 1 . 
b The fluorescent label is shown with the forward primer. 



modal length of -160 bp, paired sequences were joined into overlapping extended 
contigs using FLASH (Magoc and Salzberg, 2011) with default settings. Using an 
input of 14,888,028 paired-end sequences, FLASH constmcted 13,775,803 contigs 
that ranged in size from 93 to 191 bp. An SSR finder script (Staton et al., in prep.) 
for di-, tri-, and tetranucleotide repeats identified 61,086 microsatellite motifs. Mi- 
crosatellites were defined as a 2-bp motif repeated eight to 40 times, a 3 -bp motif 
repeated seven to 30 times, or a 4-bp motif repeated six to 20 times. Using the pro- 
gram CAP3 (Huang and Madan, 1999), redundant sequences were filtered from the 
SSR-containing sequences (identity >95%), leaving only putatively unique loci. 
The filtered reads were assessed with Primer3 (Rozen and Skaletsky, 2000) to iden- 
tify primers using default program settings with slight modifications: melting tem- 
perature = 54°C minimum, 58°C optimum, and 62°C maximum; amplicon size = 
100 bp minimum, 200 bp maximum; and primer length = 17 bp minimum, 19 bp 
optimum, and 25 bp maximum. A total of 4715 primer pairs flanking microsatellite 
motifs were identified (4084 di-, 544 tri-, and 87 tetranucleotide motifs). The ge- 
nomic SSR data are publicly available through the National Center for Biotechnol- 
ogy Information (NCBI) Short Read Archive. 

Amplification and polymorphism of primers for 108 dinucleotide and 36 
tetranucleotide repeat motifs were assessed in a panel of seven unrelated 
individuals (seed parent and six potential pollen donors of the 88 single-tree pro- 
geny; Appendix 1) after electrophoretic separation on the QIAxcel Fast Analysis 
System using the QIAxcel DNA High Resolution Kit for microsatellite analysis 
(QIAGEN). Polymorphic loci were amplified in 36 samples from a provenance 
experiment (Kellogg Forest, Michigan, 28 provenances, latitudinal range: 
30°11 , N-42°45 , N, longitudinal range: 76°19 , W-106°37 , W; Appendix 2) and 
in 88 single-tree progeny using fluorescent-labeled forward primers (6-FAM, 
PET, NED, and VIC). Amplification products were separated on an ABI Prism 
Genetic Analyzer 3730 (Applied Biosystems, Foster City, California, USA) and 
scored with GeneMapper version 4.0 (Applied Biosystems). PCRs were per- 
formed in a 15-jiL reaction mix that contained 3 uL of 5x HOT FIREPol Blend 
Master Mix Ready to Load (contains 10 mM MgCl 2 , 0.6 units of HOT FIREPol 
Taq polymerase, and 2 mM dNTPs; Solis BioDyne, Tartu, Estonia), 2 jiL each 
of 5 jiM fluorescent-labeled forward (Applied Biosystems) and reverse primers 
(Sigma-Aldrich, St. Louis, Missouri, USA), 6 jllL double deionized water 
(DNase- and RNase-free), and 2 JiL DNA (-1.8 ng/jiL). Amplification was 
carried out in a Peltier Thermal Cycler (GeneAmp PCR system 2700, Applied 



Biosystems). The PCR profile was as follows: 15 min denaturation at 95 °C, 
followed by 35 cycles of 45 s denaturation at 94° C, a 45 s annealing step at the 
annealing temperature (Table 1), a 45 s elongation at 72°C, and a final extension 
step at 72°C for 20 min. Observed (H Q ) and expected (H e ) heterozygosities 
(Nei, 1973) and number of alleles (A) were calculated in GENEPOP version 
4.0.10 (Raymond and Rousset, 1995). Pairwise linkage disequilibrium for all 
loci was also calculated in GENEPOP. 

All 144 primer pairs amplified products in the expected size range and 14 
were polymorphic in the set of seven unrelated individuals after electropho- 
retic separation on the QIAxcel Fast Analysis System (QIAGEN). Using the 
diversity panel of 36 individuals from the species distribution range (Appendix 2), 



Table 2. Genetic properties of the 14 novel microsatellite markers in 
Gleditsia triacanthos. 



Locus 


A 




He 


GLT002 


19 


0.886 


0.914 


GLT021* 


3 


0.333 


0.549 


GLT026* 


11 


0.735 


0.864 


GTT057* 


11 


0.778 


0.829 


GTT063* 


12 


0.667 


0.793 


GTT073* 


12 


0.944 


0.825 


GTT114* 


17 


0.889 


0.877 


GTT116 


6 


0.214 


0.716 


GTT117* 


6 


0.257 


0.400 


GTT118* 


7 


0.639 


0.705 


GTT126 


20 


0.853 


0.934 


GTT131* 


9 


0.778 


0.821 


GTT132* 


17 


0.639 


0.895 


GLT4027* 


5 


0.676 


0.729 



Note: A = number of alleles; H Q = expected heterozygosity; H Q = observed 
heterozygosity. 

* Primers characterized in the seed parent and 88 progeny. 
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the 14 micro satellite markers showed relatively high levels of polymorphism 
with number of alleles per locus ranging between three and 20 (Table 2). 
Genomic microsatellites are associated with high levels of polymorphism due 
to their occurrence in the less conserved untranscribed regions of DNA. H Q 
ranged from 0.214 to 0.944 and H e from 0.400 to 0.934 (Table 2). H Q and H e 
were similar in the samples for each locus, except for GTT116 (Table 2), 
which had a high number of missing data, indicating low incidence of null 
alleles for most of the 14 micro satellite markers. No significant linkage dis- 
equilibrium was detected between markers (P < 0.05) after Bonferroni correc- 
tion. Out of the 14 loci, 1 1 were heterozygous in the seed parent of 88 progeny, 
and regular segregation of the maternal alleles was assessed in the progeny 
using a % 2 test. Segregation distortion was observed for GLT026 and GTT131 
(P < 0.05), and for GTT117 and GTT4027 (P < 0.01). Distorted segregation 
of alleles could be attributed to a variety of both genetic and physiological 
factors, including pollen-tube competition, pollen lethals, preferential fertil- 
ization, and elimination of zygotes (Lu et al., 2002). 

CONCLUSIONS 

This is the first report of genomic microsatellites for G. 
triacanthos. The high levels of polymorphism at the 14 loci 
are especially useful for gene flow and mating system analy- 
ses in a species that is functionally dioecious. The micro- 
satellites will also facilitate the study of the effect of isolation 
and fragmentation on genetic variation and structure in G. 
triacanthos populations. 
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Appendix 1 . Geographic coordinates of Gleditsia triacanthos seed parent 
(784) and potential pollen parents (Butternut Valley, Memphis, 
Tennessee, USA) used for micro satellite marker screening. 



Accession no. Locality Latitude Longitude 



784* DeKalb, Tennessee, USA 35°54'46.607"N 85°54'32.083"W 

780 DeKalb, Tennessee, USA 35°54 , 47.090 ,, N 85°54'31.678"W 

781 DeKalb, Tennessee, USA 35°54'47.090"N 85°54'31.678"W 

782 DeKalb, Tennessee, USA 35°54 , 47.256 ,, N 85°54'32.114"W 

783 DeKalb, Tennessee, USA 35°54'46.697"N 85°54'31.244"W 

785 DeKalb, Tennessee, USA 35°54 , 51.779 ,, N 85°54 , 29.778 ,, W 

786 DeKalb, Tennessee, USA 35°54 , 50.392 ,, N 85°54'30.755"W 



* Individual used to make the Illumina library. 



Appendix 2. Geographic coordinates of Gleditsia triacanthos provenances 
from a provenance experiment, Kellogg Forest, Michigan, USA. 



Accession no. 


Locality 


Latitude 


Longitude 


60 


Bulloch, Georgia, USA 


32°28'N 


8i°46'w 


90 


Franklin, Kentucky, USA 


38°10'N 


84°52'W 


91 


Franklin, Kentucky, USA 


38°10'N 


84°52'W 


109 


Hickman, Kentucky, USA 


36°45'N 


89°05'W 


110 


Hickman, Kentucky, USA 


36°45'N 


89°05'W 


154 


Washington, Texas, USA 


3o°ii'n 


96°37'W 


160 


Washington, D.C., USA 


38°55'N 


77°00'W 


161 


Lancaster, Pennsylvania, USA 


4o°oi'n 


76°19'W 


166 


Huntington, Pennsylvania, USA 


40°22'N 


77°54'W 


169 


East Carroll, Louisiana, USA 


32°47'N 


91°10'W 


218 


Chester, South Carolina, USA 


34°43'N 


8i°i3'w 


234 


Aiken, South Carolina, USA 


33°34'N 


81°43'W 


247 


Franklin, Kansas, USA 


37°45'N 


95°10'W 


258 


Bienville, Louisiana, USA 


32°32'N 


92°55'W 


261 


Bienville, Louisiana, USA 


32°32'N 


92°55'W 


278 


Warren, Ohio, USA 


39°25'N 


84°12'W 


281 


Delaware, Ohio, USA 


40°22'N 


82°57'W 


300 


Fairfax, Virginia, USA 


38°51'N 


77°19'W 


305 


Ogle, Illinois, USA 


42°01'N 


89°20'W 


327 


Monroe, Arkansas, USA 


34°40'N 


91°19'W 


337 


Texas, Missouri, USA 


37°31'N 


91°50'W 


340 


Phelps, Missouri, USA 


37°55'N 


91°55'W 


341 


Phelps, Missouri, USA 


37°55'N 


91°55'W 


342 


Larimer, Colorado, USA 


40°34'N 


105°04'W 


366 


Story, Iowa, USA 


42°01'N 


93°32'W 


367 


Story, Iowa, USA 


42°01'N 


93°32'W 


370 


Erie, Ohio, USA 


41°27'N 


82°42'W 


387 


Burke, North Carolina, USA 


35°45'N 


81°46'W 


422 


Bernalillo, New Mexico 


35°04'N 


106°37'W 


444 


Polk, Iowa, USA 


41°43'N 


93°35'W 


445 


Polk, Iowa, USA 


41°43'N 


93°35'W 


447 


Ingham, Michigan, USA 


42°45'N 


84°30'W 


457 


Monongalia, West Virginia, USA 


39°37'N 


79°57'W 


461 


Piatt, Illinois, USA 


39°47'N 


88°37'W 


465 


Piatt, Illinois, USA 


39°47'N 


88°37'W 
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